Profiling Techniques for a Fine-Grained Parallel Language
نویسندگان
چکیده
Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine-grained parallel programming language Split-C, which provides a simple global address space memory model. One profiler provides a detailed analysis of a program’s execution. The other profiler collects cumulative information. As our experience shows, it is much more challenging to profile programs that make use of efficient, low-overhead communication. We incorporated techniques which minimize profiling effects on the running program. We quantify the profiling overhead and present several Split-C applications which show that the profiler is useful in determining performance bottlenecks.
منابع مشابه
An Approach for Fine-Grained Profiling of Mesh-Based Parallel Programs
Characterizing the dynamic behavior of parallel programs in terms of their execution profile helps to understand their behavior and suggest optimization strategies to improve the performance. Traditional event tracing techniques write the profiled data to trace files. Using the traditional approach for fine grained profiling not only yields large unwieldy trace files but often also gives skewed...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملLoad Balancing in the Fine-Grained Object-Oriented Language Ellie
This position paper presents the main goals of the Ellie project which have been to design and implement a general, powerful, fine-grained, object-oriented, programming language intended for machine independent programming of distributed memory parallel computers. The implementation includes operating system facilities for message routing and load balancing. The main subject of this paper is lo...
متن کاملSynchronization coherence: A transparent hardware mechanism for cache coherence and fine-grained synchronization
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. Ever increasing chip densities based on CMOS improvements fuel research in highly parallel chip multiprocessors with 100s of processing elements. With such increasing levels of parallelism, synchronization is set to become a major performance bottleneck and efficient support for synchronization a...
متن کاملEvolution of texture in an ultrafine and nano grained magnesium alloy
The evolution of texture was discussed during the formation of ultra-fine and nano grains in a magnesium alloy severely deformed through accumulative back extrusion (ABE). The microstructure and texture obtained after applying multiple deformation passes at temperatures of 100 and 250°C were characterized. The results showed that after single ABE pass at 100°C an ultrafine/nano grained microstr...
متن کامل